Semi-Supervised Noisy Label Learning for Chinese Medical Named Entity Recognition
نویسندگان
چکیده
This paper describes our approach for the Chinese clinical named entity recognition (CNER) task organized by 2020 China Conference on Knowledge Graph and Semantic Computing (CCKS) competition. In this task, we need to identify boundary category labels of six entities from electronic medical record (EMR). We constructed a hybrid system composed semi-supervised noisy label learning model based adversarial training rule post-processing module. The core idea is reduce impact data noise optimizing results. Besides, used rules correct three cases redundant labeling, missing wrong labeling in prediction Our method proposed achieved strict criteria 0.9156 relax 0.9660 final test set, ranking first.
منابع مشابه
Semi-supervised Named Entity Recognition in noisy-text
Many of the existing Named Entity Recognition (NER) solutions are built based on news corpus data with proper syntax. These solutions might not lead to highly accurate results when being applied to noisy, user generated data, e.g., tweets, which can feature sloppy spelling, concept drift, and limited contextualization of terms and concepts due to length constraints. The models described in this...
متن کاملA Semi-supervised Learning Approach to Arabic Named Entity Recognition
We present ASemiNER, a semisupervised algorithm for identifying Named Entities (NEs) in Arabic text. ASemiNER does not require annotated training data, or gazetteers. It also can be easily adapted to handle more than the three standard NE types (Person, Location, and Organisation). To our knowledge, our algorithm is the first study that intensively investigates the semi-supervised pattern-based...
متن کاملSemi-supervised Bio-named Entity Recognition with Word-Codebook Learning
We describe a novel semi-supervised method called WordCodebook Learning (WCL), and apply it to the task of bionamed entity recognition (bioNER). Typical bioNER systems can be seen as tasks of assigning labels to words in bioliterature text. To improve supervised tagging, WCL learns a class of word-level feature embeddings to capture word semantic meanings or word label patterns from a large unl...
متن کاملSemi-Supervised Learning of Named Entity Substructure
The goal of this project was two-fold: (1) to provide an algorithm to correctly find and label named entities in text, and (2) to uncover substructure in the named entities (such as a first name, last name distinction among person entities). The underlying algorithm used is a Class Hidden Markov Model (CHMM), a Hidden Markov Model with hidden states that emit observed words as well as observed ...
متن کاملChinese Named Entity Recognition with Graph-based Semi-supervised Learning Model
Named entity recognition (NER) plays an important role in the NLP literature. The traditional methods tend to employ large annotated corpus to achieve a high performance. Different with many semi-supervised learning models for NER task, in this paper, we employ the graph-based semi-supervised learning (GBSSL) method to utilize the freely available unlabeled data. The experiment shows that the u...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Data intelligence
سال: 2021
ISSN: ['2096-7004', '2641-435X']
DOI: https://doi.org/10.1162/dint_a_00099